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SPECIFICATION 

Electronic Version 1.2.8 
Stylesheet Version 1 .0 

SYSTEM AND METHOD FOR 
IMPROVED SYNCHRONOUS 
DATA ACCESS 

Background of the Invention 

[0001] The present invention relates generally to accessing data from a synchronous data 
source and more particularly to compensating for skew in signal paths to and from a 
synchronous data source. 

[0002] 

A major concern in the design and manufacture of electronic devices is the 
latency, or skew, between the transmission and reception of a signal resulting from 
the physical characteristics of the device and/or the transmission medium. Skew is of 
particular concern in the development of synchronized systems utilizing a common 
clock signal. Skew in the clock signal paths, the data signal paths, and/or the control 
signal paths in synchronized systems, if significant and/or variable, can result in an 
erroneous operation of the synchronous system. To illustrate, known synchronous 
memory access systems typically include a memory controller utilized to read data 
from a synchronous memory device, such as a synchronized dynamic random access 
memory (SDRAM). To access data from the memory matrix of the synchronous 
memory device, the memory controller typically provides address and control 
information generated by a command generation module to the memory matrix via a 
control path that generally includes an off-chip driver (OCD), an address/control 
interconnect, and an input receiver (IR). Likewise, a clock signal generated by a clock 
generator (denoted as an original clock signal) is provided via a clock signal path that 
typically includes an OCD, a clock interconnect, and an IR. Based on the 
address/control information, the memory device retrieves the requested stored data 
from the memory matrix and provides a data signal representative of the stored data 
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to an input sampling module of the memory controller via a data path that generally 
includes an OCD, a data interconnect, and an IR. The input sampling module samples 
the data signal using the original clock signal to latch the stored data, processes the 
data as appropriate, and then provides the data to its appropriate destination. 

[0003] Synchronous memory devices, such as the above memory device, often are 

implemented to minimize or avoid the overhead (e.g., handshaking and wait-states) 
common to asynchronous devices. By using a common clock signal and a 
predetermined memory access latency, the memory controllers generally can send a 
request for data to a memory device and then sample the output from the memory 
device after a predetermined number of clock cycles to obtain the stored data from 
the memory device. For example, assume that the time between the receipt of a read 
request by the memory device and the output of the requested data by the memory 
device is one clock cycle (this time herein referred to as the memory access latency). 
The input sampling module, knowing that the requested data will be on the data 
interconnect no later than one cycle after a read request is transmitted, can sample 
the data signal on the data interconnect at the next rising edge (i.e., one cycle later) to 
obtain the requested data from the memory device. 

[0004] However, it will be appreciated that the skew in the signal paths of many such 

systems can be relatively large and/or can vary greatly compared to the clock signal 

period. In addition to on-chip delays, skew can result from, for example, the 

operation of off-chip drivers (OCDs) that are used to drive their respective signals on 

interconnects to input receivers (IRs). Further, skew may result from the transmission 

of electrical signals over the physical mediums (e.g., printed circuit board (PCB) traces) 

of the interconnects. Delays introduced by an OCD, an IR, and an interconnect are 

herein referred to as t , t , and t , respectively. 

OCD IR PCB Y 

[0005] 

Although the delays resulting from each of the OCDs, IRs, and interconnects 
sometimes are not large enough to individually affect the operation of the input 
sampling module of the memory controller, the sum of their delays often can result in 
timing problems, thereby resulting in sampling errors, even at relatively slow clocking 
frequencies. Likewise, although the variance in the delay of each of the OCDs, IRs, and 
interconnects may be relatively insignificant, the resulting combination of variances of 
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the elements of a signal path also can result in sampling errors. 

[0006] To illustrate, assume that the original clock signal is transmitted over the clock 

signal path (i.e., an OCD, a clock interconnect, and an IR) in the known system 

described above. Accordingly, the skew (t ) between the original clock signal 

skew 

generated by the clock generator and the clock signal received by the memory matrix 

is substantially equivalent to the sum of the individual delays of the OCD, the IR, and 

the clock interconnect, ort = t + t + t . If this sum (t ) has a 

skew OCD IR PCB skew 

possible range that exceeds the clock cycle time or varies considerably during 
operation, the input sampling module of the memory controller typically cannot 
reliably know at which clock edge to sample the data signal from the memory device. 

[0007] Accordingly, the cycle time of the original clock signal often is enlarged to 

minimize or prevent unreliable signal sampling by the input sampling module. 

Different memory systems have different associated delays. For example, a DDR 

SDRAM memory has an internal delay lock loop (DLL) compensating for its IR and OCD 

delay. When using such memory, the uncompensated delays result from the OCD of 

the memory controller, the clock signal transmission time over the clock interconnect, 

data signal transmission time over the data interconnect, and the memory controller's 

IR delay. For such systems, when using a single synchronous clock, the clock cycle 

time (T) must be greater than the sum of the above elements (i.e., T > t + t + 
y OCD IR 

2t ). Of course, in order to guarantee reliable operation other factors should also 
PCB 

be taken into account when doing the time budgeting, such as clock period 
uncertainty, PCB related jitter, setup requirements, etc. 

[0008] 

Another important factor to be considered is the difference between clock and 
data loading. A clock signal typically goes to all memory devices. The data lines are 
typically less loaded than the clock line, because several thin memory components can 
be used to assemble a wide memory system. In high-speed memories, the data lines 
are usually point-to-point, whereas the clock net is loaded by several memory 
devices. Therefore, the uncertainty in the OCD delay and interconnect transmission 
line for the data lines typically is smaller than the uncertainty for the clock line (also 
for the control lines). For stub series terminated logic (SSTL) class II drivers (one 
example of an OCD) 7 the t QCD typically can vary between 0.5 nanosecond (nS) and 3 
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nS, whereas the t value for SSTL class II inputs (one example of an IR) can range 

from O.l nS to 0.9 nS. Likewise, PCB traces (one example of an interconnect) typically 

have a t value that ranges between 75 picoseconds (pS) and 500 pS for lengths 
PCB 

from one-half inch to three inches. Using these exemplary values, the value of (t 

y OCD 

+ t ^ + 2t ) can vary from between 0.75 nS to 4.9 nS. Adding additional factors 
that contribute to timing uncertainty such as setup requirements, clock jitter, etc., 
having a total of l nS, the minimum cycle time for the original clock signal, in this 
example, is at least 5.9 nS, corresponding to a maximum clock frequency of about 
1 70 megahertz (MHz). Since the system clock frequency of many types of computing 
systems, such as communication and graphics systems, exceeds this 1 70 MHz limit, a 
different approach to using synchronous memories is desirable. 

[0009] Accordingly, mechanisms attempting to compensate for the effects of skew while 
preserving high clock frequencies have been developed, a number of which are 
discussed below. One known implementation for minimizing the effects of skew 
utilizes a clock generator that is external to both the memory device and the memory 
controller. In this case, the clock generator provides an original clock signal to both 
the memory device and the memory controller. In the event that the delay in the signal 
path between the clock generator and the memory controller is substantially 
equivalent to the delay in the signal path between the clock generator and the 
memory controller, the memory device and the memory controller typically would be 
synchronized to the same clock signal, effectively negating the OCD, IR, and 
interconnect delays of the clock signal path. However, these same delays exist in the 
address/control path and the read data path, thereby causing timing errors at the 
memory device due to the relative skew between the address/control/read signals and 
the common clock signal. As a result, uncertainty is introduced in the sampling of the 
control and/or address signals by the memory device as a result of varying or 
unknown skew in the control/address signals. Likewise, variable and/or unknown 
skew in the read data path between the memory device and the memory controller 
also introduces uncertainty in the sampling of read data from the memory device by 
the memory controller. 



[0010] 



Another known system for minimizing the effects of skew implements a phase 
lock loop (PLL) and an additional IR. In this known implementation, the original clock 
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signal from the clock generator of the memory controller is provided to the PLL as a 
reference clock signal and the output clock signal to the memory is supplied as 
feedback to the PLL via a signal path that simulates the clock signal path between the 
memory controller and the memory device. 

[001 1] Using this feedback clock signal, the PLL typically attempts to ensure that the 
phase of the skewed clock signal used by the memory device and the original clock 
signal used by the input sampling module are the same by providing a corrected clock 
signal, effectively negating the OCD, IR, and interconnect delays for the clock signal 
path. However, as with the above known implementation, delays still exist in the 
address/control path and the read data path, thereby causing timing errors at the 
memory device due to the relative skew between the address/control/read signals and 
the corrected clock signal. Advancing the clock frequency by the OCD, IR, and 
interconnect delays to compensate for the relative skew typically fails since the 
sampling timing at the input of the memory device then is misadjusted, as are the 
address and control signals, resulting in uncertainty in sampling of the address 
and/or control signals. 

[001 2] Yet another known implementation utilizes resampling in an attempt to overcome 
problems resulting from clock skew. In this case, the memory controller of the known 
system typically includes a resampling module to resample the output from the input 
sampling module of the memory controller using the original clock signal. As with the 
above known implementation, an additional IR generally is utilized to provide a 
skewed clock signal to the input sampling module that is substantially equivalent to 
the skewed clock signal received by the memory device. Accordingly, the input 
sampling module can more reliably sample the data signal provided by the memory 
device using the skewed clock signal (disregarding the delays introduced over the data 
signal path). 

[0013] 

While the data can be more reliably sampled by the input sampling module using 
the skewed clock signal, it will be appreciated, however, that the data is sampled in 
the skewed clock domain rather than in the original clock domain used by the 
remainder of the memory controller. Accordingly, known solutions implement the 
resampling module to resample the once-sampled data to convert the data from the 
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skewed clock domain to the original clock domain using the original clock signal as 
the timing reference. While this generally is successful when the signal delays and/or 
variance are relatively minor, the resampling module experiences the same unreliable 
sampling issues when the control signal delays are as great or greater than the time 
period of the clock signal due to the inherent delays in the control signal path, since it 
may be unclear as to which read command a set of sampled data is associated. When 
this delay is relatively significant, it can affect the timing of the resampling module 
and, therefore, cause any resampled data to be unreliable. 

[0014] Accordingly, a system and a method for improved synchronization during a data 
access from a synchronous data source would be advantageous. 

Summary of the Invention 

[001 5] The present invention mitigates or solves the above-identified limitations in 
known solutions, as well as other unspecified deficiencies in known solutions. A 
number of advantages associated with the present invention are readily evident to 
those skilled in the art, including economy of design and resources, greater system 
performance, flexibility, cost savings, etc. 

^ ' In accordance with one embodiment of the present invention, a data requestor for 

obtaining requested data from a synchronous data source is provided, wherein the 
data requestor transmits for reception by the data source an original clock signal and 
an original control signal, which is representative of a data request, and the data 
source receives a delayed version of the original clock signal resulting from a delay in 
a clock signal path between the data requestor and the data source and a delayed 
version of the original control signal resulting from a delay in a control signal path 
between the data requestor and the data source. The data requestor comprises a 
skewed clock signal generator adapted to generate a skewed clock signal that is 
substantially equivalent to the delayed version of the original clock signal and a 
skewed control signal generator adapted to generate a skewed control signal that is 
substantially equivalent to the delayed version of the original control signal. The data 
requestor further comprises an input sampling module for receiving the skewed clock 
signal and the skewed control signal and being adapted to sample, using the skewed 
clock signal and the skewed control signal, a data signal to obtain the requested data, 
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wherein the data signal is representative of the requested data and is provided by the 
data source based at least in part on the delayed versions of the original clock signal 
and the original control signal. 

[001 7] In accordance with another embodiment of the present invention, an apparatus for 
synchronizing a data requestor with a data source during a provision of requested 
data is provided, the data requestor transmitting for reception by the data source an 
original control signal representative of a data request and an original clock signal 
and the data source receiving delayed versions of the original clock signal and the 
original control signal resulting from a delay in a signal path between the data 
requestor and the data source. The apparatus comprises an input sampling module 
having at least one input and being adapted to sample a data signal to obtain the 
requested data using a skewed clock signal substantially equivalent to the delayed 
version of the original clock signal and a skewed control signal that is substantially 
equivalent to the delayed version of the original control signal, wherein the data signal 
is representative of the requested data and is provided by the data source based at 
least in part on the delayed versions of the original clock signal and the original 
control signal. The input sampling module is further adapted to perform at least one 
process operation on the requested data based at least in part on process information 
associated with the requested data. The apparatus further comprises means for 
providing the process information to the input sampling module synchronously with 
the skewed clock signal and the skewed control signal. 

[001 8] | n accorc | ance with yet another embodiment of the present invention, a memory 
controller for obtaining stored data from a synchronous memory device is provided. 
The memory controller transmits for reception by the memory device a first clock 
signal and a first control signal representative of a data request and the memory 
device receives a delayed version of the first clock signal resulting from a delay in a 
clock signal path between the memory controller and the memory device and a 
delayed version of the first control signal resulting from a delay in a control signal 
path between the memory controller and the memory device. The memory controller 
comprises a clock signal skewing circuit for communicating the first clock signal over 
a clock signal skew path having a delay substantially equivalent to the delay of the 
clock signal path, whereby the clock signal skewing circuit delivers a second clock 
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signal representative of the delayed version of the first clock signal and a control 
signal skewing circuit for communicating the first control signal over a control signal 
skew path having a delay substantially equivalent to the delay of the control signal 
path, whereby the control signal skewing circuit delivers a second control signal 
representative of the delayed version of the first control signal. 

[001 9] The memory controller further comprises a first dual clock FIFO buffer in electrical 
communication with the clock signal skewing circuit and being adapted to store, using 
the first clock signal, process information associated with the stored data, and output, 
using the second clock signal, the process information. The memory controller 
additionally comprises an input sampling module in electrical communication with the 
clock signal skewing circuit and the first dual clock FIFO buffer, the input sampling 
module being adapted to sample a data signal based at least in part on the second 
clock signal and the second control signal, wherein the data signal is representative of 
the stored data and js provided by the memory device based at least in part on the 
second clock signal and second control signal, obtain the process information from 
the first dual clock FIFO buffer based at least in part on the second clock signal and 
the second control signal, and perform at least one process operation on the stored 
data based at least in part on the process information. 

[0020] In accordance with yet another embodiment of the present invention, a method for 
synchronizing a data requestor with a data source during a transfer of requested data 
is provided. The method comprises the steps of generating a skewed clock signal 
approximating a delayed version of an original clock signal communicated to the data 
source, the delayed version resulting at least in part from delay associated with a 
clock signal path over which the original clock signal is transmitted between the data 
requestor and the data source, and generating a skewed control signal approximating 
a delayed version of an original control signal communicated to the data source, the 
delayed version resulting at least in part from delay associated with a control signal 
path over which the original control signal is transmitted between the data requestor 
and the data source. The method further comprises the step of sampling, using the 
skewed clock signal and skewed control signal, a data signal received by the data 
requestor from the 'data source to obtain the requested data. 
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[0021] In accordance with an additional embodiment of the present invention, a method 

for synchronizing a memory controller with a synchronous memory device during a 
read access of stored data is provided. The method comprises the steps of sampling, 
using a second clock signal and a second control signal, a data signal provided by the 
memory device to obtain the stored data, the second clock signal being representative 
of a delayed version of a first clock signal received by the memory device from the 
memory controller and the second control signal being representative of a delayed 
version of a first control signal received by the memory device from the memory 
controller and obtaining, using the second clock signal, process information 
associated with the stored data from a first dual clock FIFO buffer. The method further 
comprises the step of performing at least one process operation on the stored data 
based at least in part on the process information. 

[0022] In a digital subscriber line modem comprising a communications processor 

coupled to a synchronous memory device, the communications processor transmitting 
for reception by the memory device an original control signal representative of a data 
request and an original clock signal and the memory device receiving a delayed 
version of the original clock signal resulting from a delay in a clock signal path 
between the communications processor and the memory device and a delayed version 
of the original control signal resulting from a delay in a control signal path between 
the communications processor and the memory device, an apparatus for 
synchronizing an access of the requested data stored in the memory device is 
provided. The apparatus comprises means for generating a skewed clock signal 
substantially equivalent to the delayed version of the original clock signal and means 
for generating a skewed control signal substantially equivalent to the delayed version 
of the original control signal. The apparatus further comprises an input sampling 
module in electrical communication with the means for generating the skewed clock 
signal and the means for generating the skewed control signal and being adapted to 
sample, using the skewed clock signal and the skewed control signal, a data signal to 
obtain the requested data, wherein the data signal is representative of the requested 
data and is provided by the data source based at least in part on the delayed versions 
of the original clock signal and the original control signal. 

[0023] Qne ac j vantage Q f at | east one embodiment of the present invention is improved 
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reliability during the sampling of the data signals from a data source during a data 
access by minimizing the effects of skew between the requestor of the data and the 
source of the data. Another advantage includes allowing the use of synchronous 
memory devices using separate read and write clocks. 

[0024] Still further features and advantages of the present invention are identified in the 
ensuing description, with reference to the drawings identified below. 

Brief Description of the Drawings 

[0025] The purpose and advantages of the present invention will be apparent to those of 
ordinary skill in the art from the following detailed description in conjunction with the 
appended drawings in which like reference characters are used to indicate like 
elements, and in which:Figure 1 is a schematic diagram illustrating an exemplary 
mechanism for compensating for signal skew at a synchronous memory device during 
a read access in accordance with at least one embodiment of the present invention. 

[0026] Figure 2 is a schematic diagram illustrating the exemplary mechanism of Figure 1 
in greater detail in accordance with at least one embodiment of the present invention. 

[0027] Figure 3 is a timing diagram illustrating an exemplary operation of the mechanism 
of Figures 1 and 2 in accordance with at least one embodiment of the present 
invention. 

[0028] Figure 4 is a schematic diagram illustrating an exemplary implementation of a 

memory controller in a digital subscriber line modem in accordance with at least one 
embodiment of the present invention. 

Detailed Description of the Invention 

[0029] 

The following description is intended to convey a thorough understanding of the 
present invention by providing a number of specific embodiments and details 
involving synchronized memory access. It is understood, however, that the present 
invention is not limited to these specific embodiments and details, which are 
exemplary only. It is further understood that one possessing ordinary skill in the art, 
in light of known systems and methods, would appreciate the use of the invention for 
its intended purposes and benefits in any number of alternative embodiments, 



AP'P_1D= 1 0064445 Page 1 0 of 40 



JL OXJiiV ^ ■ - 5-* 5 Si m O > X?5 CLE 

depending upon specific design and other needs. 

[0030] Figures 1-4 illustrate mechanisms for improved synchronization during a data 

access from a synchronous data source. In at least one embodiment, a data requestor 
provides a common clock signal and a control signal to a data source, indicating a 
request for data from the data source. As a result of significant delays present in the 
signal paths between the data requestor and the data source, the reception of the 
common clock signal and the control signal are delayed during transmission, causing 
the timing of the output of the requested data from the data source to be unreliable. 
Accordingly, in at least one embodiment, the data requestor provides skewed clock 
and control signals to its input sampling module to compensate for the delay between 
the data requestor and the data source, where the skewed signals are substantially 
equivalent to the corresponding actual delayed signals received at the data source. 
Using these skewed clock and control signals, the data can be more reliably sampled. 
Additionally, in at least one embodiment, process information, which includes 
implementation-specific information associated with the requested data (e.g., a 
destination address), is generated in the original clock domain but provided to the 
module handling the requested data in the skewed clock domain. In one embodiment, 
a dual clock FIFO is used to store the process information using the common clock 
and the process information can retrieved from the FIFO at the appropriate time using 
the skewed clock signal. In another embodiment, a signal path having a delay 
substantially similar to the control signal path and/or the clock signal path can be 
used to delay the process information so that it is synchronized with the skewed 
control signal. 

[0031] Although the present invention may be implemented in any number of systems 
wherein a data requestor and a data source utilize a common clock to synchronize a 
transfer of requested data between the data source and the data requestor, the 
present invention finds particular benefit in systems implementing a memory 
controller (one embodiment of the data requestor) and a synchronous memory device 
(one embodiment of the data source). While Figures 1 -4 illustrate various 
implementations of a memory controller used to access stored data from a 
synchronous memory device in accordance with various embodiments of the present 
invention, these exemplary illustrated embodiments are not intended to limit the 
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present invention to such implementations. Rather, the present invention may be 
implemented by those skilled in the art in a number of synchronous data access 
mechanisms, using the guidelines provided herein. 

[0032] In the following discussion of Figures 1 -4, reference is made to the timing, 
sampling, and transmission of digital signals between one or more electrical 
components. Those skilled in the art will appreciate that the successful transmission 
and reception of such signals often requires certain signal considerations, including 
set up times, hold times, and the like. However, for the ease of illustration, the 
following discussion disregards these widely known signal considerations. 
Mechanisms for coping with such signal considerations are well known and may be 
implemented by those skilled in the art, using the guidelines provided herein, without 
departing from the spirit or the scope of the present invention. 

[0033] Referring now to Figure 1 , an exemplary memory access system 100 is illustrated 
in accordance with at least one embodiment of the present invention. In the illustrated 
embodiment, the memory access system 100 includes a data requestor adapted to 
request and obtain data from a data source. More particularly, the system 1 00 
includes a memory controller 102 (i.e., the data requestor) connected to a 
synchronous memory device 1 04 (i.e., the data source). The memory controller 1 02 
can serve any type of application, such as microprocessor system (personal computer, 
workstation, etc.), embedded systems, graphics and game systems and the like. 

[0034] In the illustrated exemplary implementation, the memory controller 1 02 includes a 
clock generator 1 06, a command generation module 1 08, a dual clock FIFO 1 1 0, an 
input sampling module 112, off-chip drivers (OCDs) 116-120 and input receivers (IRs) 
1 24-1 28. The memory device 1 04 can include any of a number of synchronous 
memory devices, including SDRAM, SSRAM, synchronous FLASH, synchronous FIFOs, 
programmable logic, etc., as well as other devices. In the illustrated embodiment, the 
memory device 1 04 includes a memory matrix 1 30, IRs 1 32-1 36, and an OCD 1 22. 

[003 5] £ s w j tn k nown memory controllers, the memory controller 1 02 can be adapted to 
obtain stored data from the memory device 104 by providing an original clock signal 
1 50 generated by the clock generator 1 06, an address signal 1 94 generated by the 
command generation module 108, and a control signal 192 (e.g., a read request) 
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generated by the command generation module 1 08 and transmitted to the memory 
device 104 via interconnects 138, 140, and 142, respectively. In turn, the memory 
device 1 04 can be adapted to locate the stored data based at least in part on the 
address, control, and clock information and to provide the requested stored data to 
the memory controller 1 02 in the form of a data signal 1 82 transmitted via the data 
interconnect 1 44. 

[0036] Rather than utilizing the original clock signal 1 50 to sample the data signal to 
extract the stored data, in at least one embodiment, a skewed clock signal 1 90 is 
provided to the input sampling module 1 1 2. The skewed clock signal 1 90, in at least 
one embodiment, has a skew relative to the original clock signal 1 50 that is 
substantially equivalent to the skew in the clock signal path (i.e., the OCD 1 1 6, the 
interconnect 1 38, and the IR 1 32) between the memory controller 1 02 and the 
memory device 104. In other words, the skewed clock signal 190, in this case, is 
intended to simulate or represent the delayed version of the clock signal 1 50 (i.e., 
skewed clock signal 1 58) as received by the memory matrix 1 30. Likewise, a skewed 
control signal 1 76 is provided via IR 1 28 to the input sampling module 1 1 2 for use in 
detecting a read request generated by the command generation module 1 08. Like the 
skewed clock signal 190, the skewed control signal 176 simulates the delayed version 
of the control signal 192 (i.e., the skewed control signal 174) received by the memory 
matrix 1 30 as a result of the transmission of the original control signal 1 92 over the 
control signal path (i.e., the OCD 120, the control interconnect 1 42, and the IR 1 36). 

[0037] By using a skewed clock signal 1 90 that is substantially equivalent (i.e., 

substantially similar skew) to the delayed version of the original clock signal 1 50 (i.e., 
skewed clock signal 1 58) received by the memory matrix 1 30 and a skewed control 
signal 1 76 that is substantially equivalent to the delayed version of the original 
control signal 1 92 (i.e., skewed control signal 1 74) received by the memory matrix 
1 30, the timing of the input sampling module 1 1 2 can be adjusted by an amount 
substantially equivalent to the delay of signal paths between the memory controller 
1 02 and the memory device 1 04. As a result, the input sampling module 1 1 2 is more 
synchronized with the memory device 1 04 and therefore can more reliably sample the 
data signal from the memory device 1 04. 



APP__ID= 10064445 



Page 13 of 40' 



«sl jujus to**-* a ■ & * u jl s slj jci 

[0038] The only remaining potential delays of any significance typically include the t QCD 

of the OCD 122, the t of the data interconnect 144, and the t of the IR 126. 

PCB IR 

However, by shifting the clock signal and the control signal used by the input 
sampling module 1 1 2 by amounts substantially equivalent to the amounts of the skew 
introduced by the clock and control signal paths, respectively, the resulting overall 
skew can be reduced by a substantial amount, even in the presence of skew in the 
data signal path (i.e., the difference between the original data signal 1 82 and the 
skewed data signal 1 88 received by the input sampling module 1 12). For example, 
assuming that the delay of each OCD of the system 1 00 is the same, the delay of each 
IR is the same, and the delay of each interconnect is the same, then the total non- 
compensated skew during a read operation can be reduced by half since the skewed 
clock signal 1 90 and the skewed control signal 1 76, as used by the input sampling 
module 112, compensate for half of the sources of delay. Accordingly, the maximum 
frequency of the clock signal 150 (theoretically) can be increased two-fold as a result 
of reduction by half in the skew, thereby allowing an increase in the read data rate by 
the memory controller 1 02. In the event that the sums of the delays introduced by 
clock and/or control signal paths are greater than the skew in the skewed data signal 
1 88, the frequency of the clock signal 1 50 can be increased even further. 

[0039] 

Any number of mechanisms may be implemented to provide a skewed clock signal 
1 90 that simulates/represents the skewed clock signal 1 58 as received by the memory 
matrix 1 30, as well as a skewed control signal 1 76 that simulates/represents the 
skewed control signal 1 74 as received by the memory matrix 1 30. In one 
embodiment, these representative skewed signals are generated by providing each 
original signal to the input sampling module 1 1 6 over a corresponding signal path 
that is substantially similar to the corresponding signal path between the memory 
controller 1 02 and the memory device 1 04. As illustrated, the original clock signal 
1 50 is provided to the input sampling module 1 1 2 via a signal path having the OCD 
1 1 6, an interconnect 1 46, and the IR 1 24. By comparison, the clock signal path to the 
memory device includes the OCD 1 16, the interconnect 1 38, and the IR 124. In the 
event that the interconnects 1 46, 1 38 have a similar delay and the IRs 1 24, 1 32 have a 
similar delay, then the delays of the original clock signal path and the simulated signal 
path to the input sampling module 1 1 6 are substantially equivalent. As such, the 
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signal path from the OCD 1 1 6 through the IR 1 24 can be assumed to have a skew that 
is substantially equivalent to the signal path from the OCD 1 1 6 through the IR 1 32. 

[0040] Similarly, the original control signal 192 can be provided from the command 
generation module 1 08 to the input sampling module 1 1 2 via a signal path that 
includes the OCD 1 1 8, an interconnect 1 48, and the IR 128, resulting in a skewed 
control signal 1 76 that simulates the skew of the original control signal 1 92 over the 
signal path having the OCD 1 20, the control interconnect 142, and the IR 1 36. 
Although one mechanism for providing a skewed clock and/or control signals that are 
substantially synchronized to corresponding delayed signals received at the memory 
device 1 04, those skilled in the art can develop other mechanisms using the 
guidelines provided herein. 

[0041] Utilizing a clock signal and a control signal that simulate the skewed clock signal 
1 58 and skewed control signal 1 74, respectively, to sample the input data signal 1 88 
can minimize or eliminate the uncertainty in the timing of the sampling. However, as 
discussed above, the sampled data is sampled in the skewed clock domain rather than 
the original clock domain. Accordingly, the uncertainty may shift from the sampling of 
the data read signal to the processing of the sampled data. 

[0042] To illustrate, assume that a first read request is initiated at the rising edge of a 
clock cycle (time t ^ ) of the original clock signal 1 50 when the command generation 
module 108 provides an address signal 194 representative of a memory location and 
a control signal 1 92 representative of a request to obtain a first set of data stored at 
the memory location to the memory device 1 04. Also assume that the memory access 
latency of the memory device 1 04 is predetermined to be one clock cycle of the 
original clock signal 1 50. Additionally, assume that a first location at which the 
requested first set of data is to be stored is provided to the input sampling module 
1 1 2 sometime in the first clock cycle following the time t ^ . Furthermore, in this 
example, a second read request is initiated at the rising edge of the next clock cycle 
(time t ) using the same process for a different set of stored data. Sometime during 
the second clock cycle, a second location is provided for the second set of data. 

[0043] in the aDsence 0 f any skew, the input sampling module 1 1 2 can sample the data 
signal 1 82 from the memory device 1 04 (received via IR 1 26 as data signal 1 88) to 
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obtain the first set of data at the rising edge of the second clock cycle (time t ). 
Using the first location received sometime after t ^ and before t , the input sampling 
module 1 12 can route the first set of data to its intended storage location. Similarly, 
the input sampling module 1 1 2 can sample.the data signal 1 88 at the rising edge of a 
third clock cycle (time t ^ ) to obtain the second set of data. Using the second location 
received from the command generation module 1 08 between time t and time t , 
the input sampling module 1 1 2 can route the second set of data to the second 
storage location. 

[0044] However, when the skewed clock signal 1 90 is used by the input sampling module 
1 1 2 to sample the skewed data signal 1 88 from the memory device 1 04 to minimize 
the effects of clock skew, the first and second sets of data are sampled in the skewed 
clock domain rather than the original clock domain. As a result, it would be difficult if 
not impossible to determine which read request of the sequence of read requests is 
associated with each of the first and second sets of data. Accordingly, in at least one 
embodiment, the command generation module 108 is adapted to provide associated 
process information (represented by signal 160) to a dual clock first in-first out (FIFO) 
buffer 1 1 0, where the FIFO 1 1 0 acts as a buffer to the input sampling module 1 1 2. 
The process information can include any of a variety of implementation-specific 
information, such as the destination locations of data retrieved from the memory 
device 1 04, received data type (such as data/instruction), target host for the data, etc. 

[0045] In at least one embodiment, a dual clock FIFO buffer 1 10 is used as an interface 
between the two different clock domains, whereby the FIFO input operations are 
synchronized to the original clock and the FIFO output operations are synchronized to 
the skewed clock. In at least one embodiment, the process information is written to 
the FIFO 1 1 0 synchronous to the original clock signal 1 50. This information is later 
read from the FIFO 1 1 0 synchronous to the skewed clock signal 1 90. As such, the 
FIFO 1 10 can be viewed as an interface between the original clock domain and the 
skewed clock domain for process information associated with data requested from the 
memory device 1 04. 

[0046] 

By using the FIFO 1 1 0 to store one or more sets of process information for use by 
the input sampling module 1 1 2, a correlation between the read requests by the 
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memory controller 1 02 and the resulting sets of sampled data can be maintained. In 
at least one embodiment, the command generation module 1 08 is adapted to 
generate process information when a read request is generated and transmitted. The 
process information associated with the read request (represented by signal 160) is 
then stored in the FIFO 1 1 0 using the original clock 1 50 and a write enable signal 
162. Based at least in part on the skewed control signal 176 and the skewed clock 
signal 190, the input sampling module 1 12 can determine the appropriate time to 
extract the process information from the FIFO 1 10 for use in processing the 
corresponding data set from the memory device 1 04. When the input sampling 
module 1 12 detects a read request (control signal 176), the input sampling module 
1 1 2 can send a read enable signal 1 78 to the FIFO 1 1 0, directing the FIFO 1 1 0 to 
output the next stored set of process information (represented by signal 1 80). 

[0047] Since the sets of process information are extracted from the FIFO 1 1 0 

synchronously with the sampling of the corresponding data by the input sampling 
module 1 1 2 (due to the skewed clock signal 1 90 and the skewed control signal 1 76), 
the input sampling module 1 12 can correctly associate read data with its 
corresponding process information. By using skewed clock and control signals via the 
IRs 1 24, 1 28 in conjunction with the dual clock FIFO 1 1 0, it will be appreciated that 
the input sampling module 1 1 2 can simulate the timing of the memory device 1 04 by 
using skewed signals that are substantially equivalent to the skewed signals received 
by the memory device 1 04. By simulating the timing, the input sampling module 1 1 2 
can more reliably synchronize the sampling of the data signal provided by the memory 
device 1 04. 

[0048] However, while using the skewed clock signal 190 to synchronize the input 

sampling module 1 1 2 and/or the dual clock FIFO 1 1 0 to the memory device 1 04, it 
will be appreciated that the data output from the input sampling module 1 1 2 is 
sampled in the skewed clock domain and therefore may cause difficulties when 
manipulated by other components of the memory controller 1 02 and/or other devices 
that are operating in the original clock domain (clock signal 1 50). Accordingly, in at 
least one embodiment, the memory controller 102 includes a second dual clock FIFO 
buffer 1 1 4 at the output of the input sampling module 1 1 2 to store output data (as 
well as any process information for used for subsequent processing with logic 
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operating synchronous to the original clock). Just as the dual clock FIFO 1 1 0 can act 
as an interface to pass process information from the original clock domain to the 
skewed clock domain, the dual clock FIFO 1 14 can act as an interface to pass resulting 
data from the skewed clock domain to the original clock domain. In this case, the 
output of the input sampling module 1 1 2 is buffered in the FIFO 1 1 4 based in part on 
the skewed clock signal 1 90 while buffered data can be output from the FIFO 1 1 4 
based in part on the original clock signal 1 50, thereby allowing components 
synchronized to the original clock signal 1 50 to properly utilize the output data from 
the input sampling module 1 1 2. It is also possible to construct a memory controller 
that would not require any process information. For example, if the memory controller 
serves only one host, and the host generates ordered requests and the memory 
controller does not change the order in which requests are handled, no process 
information is required at the controller. On the other hand, if this exemplary memory 
controller would change the order of read requests going to the memory, such FIFO to 
hold the re-ordering information (denoted here as process information) would be 
desirable. < 

[0049] 

Although one mechanism of synchronizing the transfer of process information 
from the command generation module 1 08 to the input sampling module 1 1 2 has 
been illustrated, those skilled in the art can develop other mechanisms, using the 
guidelines provided herein, without departing from the spirit or the scope of the 
present invention. For example, in one embodiment, the process information 
generated by the command generation module 1 08 is provided to the input sampling 
module 1 1 2 via a process signal path having a similar skew as the control signal path 
and/or the clock signal path. The skew of the control signal path/clock signal path 
could be simulated in the process signal path, for example, by transmitting the 
process information to the input sampling module 1 1 2 over a signal path having: an 
OCD having similar properties as the OCD 1 66 or OCD 1 20; an interconnect similar to 
the interconnect 144 or the interconnect 148; and an IR having similar properties as 
the IR 1 32 or IR 1 28. As a result, the process information delayed between the 
command generation module 1 08 and the input sampling module 1 1 2 by a similar 
amount of time as the clock and/or control signals over their respective signal paths. 
Accordingly, using the process signal path having a similar delay as the control/clock 
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signal paths, the process information can be synchronized to the skewed control 
signal 1 76 and/or the skewed clock signal 1 90. 

[0050] Referring now to Figures 2 and 3, an exemplary operation of the system 100 is 
illustrated in accordance with at least one embodiment of the present invention. 
Figure 2 illustrates an exemplary implementation of the input sampling module 1 12. 
In the illustrated embodiment, the input sampling module 1 1 2 includes a timing 
controller 21 2, a flip flop 220 for each data line of the data interconnect 144, and an 
information processing module 230. For ease of illustration, a single flip flop 220 
corresponding to a single data line of the data interconnect 144 is shown in Figure 2. 
The components of the input sampling module 1 1 2 can be implemented as software, 
firmware, hardware, or a combination thereof. 

[0051] In at least one embodiment, the skewed clock signal 1 90 and the skewed control 
signal 1 76 are provided to the timing controller 2 1 2, whereupon the timing controller 
212 is adapted to provide an enable signal to the flip flop 220 based at least in part 
on the skewed clock signal 1 90 and/or the skewed control signal 1 76. For example, 
assume that the memory access latency of the memory device 1 04 (Figure 1 ) is one 
clock cycle. As such, the requested stored data can be made available on the data 
interconnect 1 44 (Figure 1 ) by the memory device 1 04 one clock cycle after the 
memory device 104 receives a read request (as part of the skewed control signal 174). 
In this case, the timing controller 212 can be adapted to send an enable signal to the 
flip flop 220 one cycle after detecting a read request from the skewed control signal 
1 76 to cause the flip flop 220 to latch the requested data on the data signal 1 88 at 
the appropriate time. 

[0052] Because the skewed control signal 1 76 and the skewed clock signal 1 90 include 

delays that are, in one embodiment, substantially equivalent to the delays in the clock 
and control signals received by the memory device 1 04, the operation of the timing 
controller 21 2 may be substantially synchronized with the memory device 1 04. Due to 
this synchronization, the flip flop 220, upon receipt of the output enable signal from 
the timing controller 212, is able to more reliably sample the data signal 1 88 since the 
timing errors resulting from the skew between the memory controller 1 02 and the 
memory device 104 are substantially minimized or eliminated. 
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[0053] The information processing module 230, in at least one embodiment, is adapted 
to perform one or more processes on latched data retrieved from the memory device 
104 based at least in part on the corresponding process information received from the 
FIFO 1 1 0. It will be appreciated that the process information and the operation(s) 
performed based at least in part on the process information are implementation- 
specific and, therefore, any of a variety of processes, or a combination thereof, may 
be implemented without departing from the spirit or the scope of the present 
invention. Such process operations can include routing the retrieved data to one or 
more destinations based on an associated destination address (one embodiment of 
the process information), decrypting the retrieved data based on an associated 
encryption key (another embodiment of the process information), and the like. 
Additionally, as discussed above with reference to Figure 1, in one embodiment, the 
information processing module 230 can be adapted to provide its output to the dual 
clock FIFO 1 1 4 (Figure 1 ) for storage using the skewed clock signal 1 90, whereupon 
the buffered data subsequently can be output to other components of the memory 
controller 102 using the original clock signal 1 50. . 

[0054] Figure 3 is an exemplary timing diagram for illustrating a skew-compensated 

memory access process of system 100 (Figures 1 and 2) in accordance with at least 
one embodiment. In the following example, waveform 350 represents the original 
clock signal 1 50 generated by the clock generator 1 06 and provided to the 
interconnects 1 38, 1 46 via the OCD 1 1 6, which introduces a delay into the clock 
signal. The waveform 352 represents the skewed clock signal as output via the OCD 
1 1 6 at point 1 52 of the interconnects 1 38, 146 and the waveform 345 represents the 
skewed clock signal as received at the input of the IR 1 32 (point 1 54) and the IR 1 24 
(point 1 56), further including the delays resulting from the interconnects, such as PCB 
trace delays. Waveform 3 56 represents the skewed version of the original clock signal 
1 50 as received by the memory matrix 1 30 (as skewed clock signal 1 5 8) and the input 
sampling module 1 1 2 (as skewed clock signal 190), where the delay between the 
original clock signal 1 50 (waveform 3 50) and the skewed clock signals 158, 190 
(waveform 3 56) is a result of the delays introduced by the respective control signal 
paths. In this example, it is assumed that that the delay of each control signal path is 
the same. 
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[0055] In response to each of the rising edges of the original clock signal 1 50, the 
command generation module 108, in this example, generates a read request 
(commands 3 1 1 -31 6) as part of the control signal 1 92 to direct the memory device 
1 04 to output data located at the address of the memory matrix 1 30 provided by the 
memory controller 1 02 via the address signal 1 94. For illustrative purposes, the 
following discussion focuses on the access of data from the memory device 130 
resulting from the second read request (command 31 2). 

[0056] In addition to generating commands 311-316 and transmitting them to the 

memory device 1 04 via the command signal path, the command generation module 
108, in one embodiment, generates and provides process information 321-326 
(waveform 362) to the FIFO 1 1 0 and asserts the write enable signal 1 60 to direct the 
FIFO 1 1 0 to store the corresponding process information. To illustrate, the command 
generation module 108 generates process information 322 (e.g., a destination 
address) corresponding to the data requested by the command 312 and asserts the 
write enable signal 160 (waveform 360) at the second rising edge of the original clock 
signal 1 50 (waveform 3 50) to direct the FIFO 1 1 0 to store the process information 
322. 

[0057] Meanwhile, as the control signal 1 92 is transmitted across the control signal 

paths, delay is introduced by the IRs, interconnects, and OCDs of the paths. Waveform 
364 represents the skewed version of the control signal 192 at the output of the OCD 
1 18 (point 164), waveform 366 represents the skewed control signal at the inputs to 
the IRs 12 8, 1 36 (points 1 72, 1 70, respectively), and waveform 368 represents the 
skewed control signal as received by the memory matrix 1 30 (skewed control signal 
1 74) and the input sampling module 1 1 2 (skewed control signal 1 76). As with the 
clock signal, it is assumed in this example that the skews of the control signal paths 
are the substantially equivalent. 

[0058] 

Waveform 374 represents the data 331 -335 output in sequence by the memory 
device 1 04 as part of data signal 1 82 in response to the commands 31 1-316 
(waveform 368). In this example, it is assumed that the data is available from the 
memory device 1 04 one cycle after receipt of a read request (commands 3 1 1 -336). 
Waveform 376 represents the delayed data signal resulting from the OCD 122 (point 
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184), waveform 3 78 represents the delayed data signal at the input to the IR 126 
(point 186), and waveform 380 represents the skewed data signal 188 as received by 
the input sampling module 1 12. 

r 

[0059] Upon detecting the command 312 from the skewed control signal 176 at the input 

sampling module 1 1 2, the timing controller 21 2 asserts the read enable signal 1 78 
(waveform 370) during the next cycle of the skewed clock signal 190 (the third rising 
edge to the fourth rising edge) to direct the FIFO 1 1 0 to output the next set of process 
information (process information 322). In the following cycle of the skewed clock 
signal 1 90, the process information 322 is output to the information processing 
module 230 for use in processing the requested data associated with the command 
312 (data 3 32). 

[0060] At approximately the same time that command 31 2 is detected by the timing 

controller 21 2, the memory matrix 1 30 detects the command 31 2 from the skewed 
signal 1 74 and places the data 332 requested by command 31 2 on the data signal 
path during the following cycle of the skewed clock signal 1 58. The timing device 
212, expecting the data 332 associated with command 31 2 to be available at the 
fourth rising edge of the skewed clock signal 190, provides an enable signal to the flip 
flop 220 to sample the data signal 1 88 at the fourth rising edge of the skewed clock 
signal 1 58 to latch the data 332. By utilizing a skewed clock signal and a skewed 
control signal that is substantially similar to the skewed clock signal and skewed 
control signal, respectively, received at the memory matrix 130, the memory 
controller 102 can substantially compensate for the skew introduced by the signal 
paths. The resulting latched data 332 is then provided to the information processing 
module 230 for further processing. 

[0061] 

In the exemplary implementation, the process information 322 includes a 
destination address for the data 332 and the information processing module 230 is 
adapted to route the data 332 retrieved from the memory device 1 04 to its 
appropriate destination based on the corresponding destination address. When data 
332 and its associated destination address are received by the information processing 
module 230, the information processing module 230 determines the appropriate 
destination of the received data from the destination address and routes the data 332 
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to this destination as appropriate. Since, in at least one embodiment, the extraction of 
the destination address (as process information 322) and the sampling of the data 
332 are synchronized based at least in part on the skewed control signal 1 76, as well 
as the skewed clock signal 1 90, there typically is little difficulty in matching a 
destination address value to its corresponding data retrieved from the memory device 
1 04, and vice versa. 

[0062] As the timing diagram of Figure 3 illustrates, correctly sampling the skewed data 
v signal 1 1 8 typically would prove impossible in the absence of the skewed control 
signal 1 76 used by the memory controller 1 02 to simulate the operation of the 
memory device 1 04. To demonstrate, if the timing controller 21 2 were to utilize the 
original control signal 192 in conjunction with the skewed clock signal 190, the timing 
controller 21 2 would detect either command 31 3 or 31 4 at the third rising edge of the 
skewed clock signal rather than command 31 2. As a result, the timing controller 
would improperly associate the data 332 sampled at the fourth rising edge with the 
command 31 3 or 31 4, rather than properly associating the data 332 with the 
command 312. Likewise, without a means for storing the process information, such as 
FIFO 1 1 0, the memory controller 1 02 typically would be unable to properly associate 
process information 322 with the data 332. To illustrate, the process information 322 
is generated after the second rising edge of the original clock source 1 50 (waveform 
340), but the data 332 associated with the process information is not latched by the 
input sampling module 1 12 (waveform 382) until around the fifth rising edge of the 
original clock signal 1 50. Accordingly, without the FIFO 1 1 0 or other similar 
mechanism to store the process information 322 until the associated data 332 is 
latched, the information processing module 230 would be unable to properly 
associate the process information 322 with the latched data 332. 

Referring now to Figure 4, an exemplary implementation of the memory controller 
1 02 in a digital subscriber line (xDSL) modem 41 0 is illustrated in accordance with at 
least one embodiment of the present invention. As noted above, the present invention 
may be implemented in a number of different devices and systems that access data 
from a synchronous data source. Particularly, in one embodiment, the present 
invention can be implemented to access data from a synchronous dynamic random 
access memory (SDRAM) 404 for use by a communications processor 420 
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implemented by the DSL modem 41 0 for processing of data provided to, and received 
from, a network 470 (e.g., the Internet). The communications processor 420 can 
include any of a variety of communications processors, such as, for example, a 
communications processor available under the trade name Helium 200 from 
GlobespanVirata, Inc. of Red Bank, New Jersey. In the illustrated embodiment, data 
provided to the DSL modem 410 from one or more network devices 460 (e.g., a 
personal computer) is received by a network interface 440 (e.g., an Ethernet interface), 
transferred to the communication processor 420 and then stored in the SDRAM 404 
for subsequent retrieval by the communications processor 420. When the data is to be 
retrieved from the SDRAM 404 for processing by the communications processor 420, 
the communications processor 420 utilizes the memory controller 1 02 to 
synchronously retrieve the data from the SDRAM 404. As discussed above with 
reference to Figures 1 -3, the memory controller 1 02, in at least one embodiment, 
simulates the skew in the signal path 41 2 between the memory controller 1 02 and the 
SDRAM 404 to determine the appropriate timing for sampling of the data signal 
provided by the SDRAM 404. As such, the memory controller 1 02 can significantly 
improve the reliability of the sampled data without requiring the use of a slower 
system clock by the communications processor 420, thereby allowing the 
communications processor to operate at higher speeds. 

[0064] Other embodiments, uses, and advantages of the invention will be apparent to 

those skilled in the art from consideration of the specification and practice of the 
invention disclosed herein. The specification and drawings should be considered 
exemplary only, and the scope of the invention is accordingly intended to be limited 
only by the following claims and equivalents thereof. 
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